Design and Implementation of WSRF-Compliant Grid Services for Mining Fuzzy Association Rules
نویسندگان
چکیده
Data mining is a widely used approach for the transformation of large amounts of data to useful patterns and knowledge. Fuzzy association rules mining is a data mining technique which tries to nd association rules without the e ect of sharp boundary problems when data contains continuous and categorical attributes. Grid data mining is a new concept, which allows the data mining process to be deployed and used in a data grid environment where data and service resources are geographically distributed. In this paper, a grid service for mining fuzzy association rules is developed. The service is implemented based on recently proposed Data Mining Grid Architecture (DMGA) and uses the Web Service Resource Framework (WSRF). Experimental evaluations, after implementing and deploying the service, show the e ectiveness and acceptable performance of the proposed grid service. Additionally, in this study, a new algorithm, namely FFDM, is developed to mine fuzzy association rules without raw data exchange, using the distributed storage of data grid environments. Empirical evaluation of FFDM reveals the scalability and e ciency of the proposed method, in addition to the advantages of minimum messaging and providing privacy of data.
منابع مشابه
The Weka4WS framework for distributed data mining in service-oriented Grids
The service oriented architecture (SOA) paradigm can be exploited for the implementation of data and knowledge-based applications in distributed environments. The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid services and applications. WSRF can be exploited for developing high-level services for distributed data mining applications. T...
متن کاملDistributed data mining services leveraging WSRF
The continuous increase of data volumes available from many sources raises new challenges for their effective understanding. Knowledge discovery in large data repositories involves processes and activities that are computational intensive, collaborative, and distributed in nature. The Grid is a profitable infrastructure that can be effectively exploited for handling distributed data mining and ...
متن کاملWeka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids
This paper presents Weka4WS, a framework that extends the Weka toolkit for supporting distributed data mining on Grid environments. Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and re...
متن کاملWSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance
The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid applications. WSRF can be exploited for developing high-level services for distributed data mining applications. This paper describes Weka4WS, a framework that extends the widely-used Weka toolkit for supporting distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF tec...
متن کاملDeveloping a Course Recommender by Combining Clustering and Fuzzy Association Rules
Each semester, students go through the process of selecting appropriate courses. It is difficult to find information about each course and ultimately make decisions. The objective of this paper is to design a course recommender model which takes student characteristics into account to recommend appropriate courses. The model uses clustering to identify students with similar interests and skills...
متن کامل